Crashes, Recoveries, and 'Core-shifts' in a Model of Evolving Networks 



O 

o 

(N 

1—5 
l> 

o 
< 



> 

en 

o 

o 
o 

> 

X 



Sanjay Jain*'''''^-'^ and Sandeep Krishna*'^ 
* Centre for Theoretical Studies, Indian Institute of Science, Bangalore 560 012, India 

'^ Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA 

^ Jawaharlal Nehru Centre for Advanced Scientific Research, Bangalore 560 064, India 

^ Emails: jain@cts.iisc.ernet.in, sandeep@physics.iisc.ernet.in 

A model of an evolving network of interacting molecular species is shown to exhibit repeated 
rounds of crashes in which several species get rapidly depopulated, followed by recoveries. The 
network inevitably self-organizes into an autocatalytic structure, which consists of an irreducible 
'core' surrounded by a parasitic 'periphery'. Crashes typically occur when the existing autocatalytic 
set becomes fragile and suffers a 'core-shift', defined graph theoretically. The nature of the recovery 
after a crash, in particular the time of recovery, depends upon the organizational structure that 
survives the crash. The largest eigenvalue of the adjacency matrix of the graph is an important 
signal of network fragility or robustness. 
PACS numbers; 89.75.Fb, 87.23.Kg, 64.60. Cn, 05.65. -l-b 



The dynamics of crashes and recoveries has been the 
subject of several empirical and modeling studies in 
macroevolution (for reviews, see ]l|,0|) and finance ^^■ 
The main attempt of most models has been to repro- 
duce quantitatively the observed statistics of event sizes. 
However, it is also worthwhile to ask whether large events 
share other common features, or signatures, that precede 
the event or characterize the kind of systemic transforma- 
tion caused by them. Here we present a structural anal- 
ysis, based on network properties, of events that occur in 
a model H in which populations of molecular species co- 
evolve with their network of catalytic interactions [0^ . 
We find large crashes to be associated with a particular 
kind of structural change in the network, which we call 
a 'core-shift', and identify network characteristics that 
signal the system's susceptibility to crashes. This kind 
of analysis might be useful for other models of biological 
and social evolution. 

The system is a directed graph with s nodes labeled 
i G S = {1, 2, . . . , s}, represented by its adjacency ma- 
trix C = (cy). If there exists a directed link from node 
j to node i in the graph then Cy = 1, else Cij = 0. Each 
node represents a molecular species in a prebiotic pond 
and Cij = 1 means that j is a catalyst for the production 
of i. The dynamical variables are the 'relative popula- 
tion vector' of the species x = {{xi, . . . ,Xs)\0 < Xi < 
l,^^^j^ Xi — 1}, which is a fast variable, and the graph 
itself (or C), which is a slow variable. Initially, each Cy 
for i ^ j is independently chosen to be unity with a 
probability p and zero with a probability 1 — p. To ex- 
clude self-replicating species, ca = for all i. Each Xi is 
chosen randomly in [0, 1] and all Xi are rescaled so that 
J2i=i Xi — 1. With C fixed, x is evolved according to 
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until it reaches its attractor (always a fixed point P,1Q| ) , 
denoted X. dl^) is an idealization of rate equations for 
catalyzed reactions in a well stirred chemical reactor. 
The set £ of nodes with the least Xi is determined, i.e. 



C = {i G S\Xi — minj^sXj}. A node, denoted k, is 
picked randomly from C and for every i ^ k Cik and Cki 
are independently reassigned to unity with probability p 
and zero with probability 1— p, irrespective of their earlier 
values. This corresponds to removing the node k and all 
its links from the graph and replacing it by a new node k 
with random links to and from the other nodes. Ckk is set 
to zero, Xk is set to a small constant xq, all other Xi are 
perturbed by a small amount from their existing value 
Xi, and all Xi are rescaled so that X]i=i ^j = 1- This 
captures, in an idealized way, the impact of a periodic 
fluctuation like a tide or flood, which can wash out one 
of the least populated species in the pond (extremal se- 
lection [|l[), and bring in a new molecular species whose 
catalytic links with those in the pond are random (intro- 
duction of novelty). Then x is again evolved to its new 
attractor, another graph update is performed, and so on. 
Fig. 1 shows the number of populated species in the 
attractor (i.e., species with Xj > 0), si, as a function of 
time for three runs with different p values. Time is repre- 
sented by n, the number of graph updates. Three regimes 
or phases of behaviour can be observed. First, the 'ran- 
dom phase' in which si fluctuates about a low value. 
Second, the 'growth phase', when si shows a clear rising 
tendency (occasionally punctuated by drops) . Third, the 
'organized phase' where si stays close to its maximum 
value, s. The average time spent in each phase depends 
upon p and s. In this letter we investigate the large and 
sudden drops in si, visible in Fig. 1 (mentioned briefly in 
Q). These 'crashes' in the organized and growth phases 
are followed by 'recoveries', in which si rises on a cer- 
tain timescale. Fig. 2 shows the probability distribution 
P(Asi) of changes in the number of populated species, 
Asi{n) = si{n) — si{n—l). The asymmetry between rises 
and drops as well as fat tails in the distribution of fluc- 
tuations are evident. For low p the probability of large 
drops is an order of magnitude greater than intermediate 
size drops (also see Fig. 1) 
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An autocatalytic set (ACS) is said to be a set of species 
which contains a catalyst for each of its members [E3-p6| . 
Here it is a subgraph each of whose nodes has at least 



one incoming link from a node of the same subgraph. (A 
subgraph is a subset of nodes together with aU their mu- 
tual links.) For example in Fig. 3, the subgraph formed 
by nodes 40, 93, 36, 51 is not an ACS, but that formed 
by 40, 93, 36, 51, 63 is. The subgraph of all black nodes 
is also an ACS. Let M{C) = Xi be the largest eigenvalue 
of C. It can be shown ||l^ that (i) if the graph does not 
have an ACS then Ai = 0, and if it does then Ai > 1. (ii) 
X is an eigenvector of C with eigenvalue Ai. (Hi) The 
set of nodes for which X^ > is uniquely determined by 
C, independent of (generic) initial condition on x. (iv) If 
Ai > 1 the subgraph formed by the set in (iii) constitutes 
an ACS, which will be referred to as the 'dominant ACS'. 
(We find X from these algebraic properties, rather than 
numerically integrating (|l|).) 

In the random phase the graph has no ACS (in Fig. 
1 this phase coincides with Ai =0). The graph remains 
random because non ACS structures are not robust p2[ . 
This phase continues on average for a time r^ = l/p^s un- 
til at some graph update a small ACS appears by chance 
and the growth phase begins (in Fig. 1, Ai jumps from 
zero to one at that very time step, and in general, at 
the beginning of every growth phase). A small ACS is 
robust, [iv) implies that members of the ACS do well 
populationally compared to species outside it, hence the 
latter are replaced in subsequent graph updates. When 
the new species receives a link from the existing dominant 
ACS, the latter typically expands and si increases. This 
growth and self-organization continues over a timescale 
Tg In s where Tg — \/p until the dominant ACS spans 
the entire graph and Sx becomes equal to s (see Fig. 3) 
P, [lO[[l^ . That marks the beginning of the organized 
phase. Note that the entire graph in Fig. 3 is an ACS. In 
a fully spanned ACS the least populated species must be 
a member of the ACS. Now competition between mem- 
bers of the dominant ACS becomes important and can 
lead to fragility and rupture of the organization. 

Let us define a crash as a graph update event n for 
which Asi(n) < — s/2, i.e., an event in which a signifi- 
cant number (arbitrarily chosen as s/2) of the species go 
extinct. In runs with s = 100, p = 0.0025 totaling 1.55 
million iterations we observed 701 crashes. It is evident 
from Fig. 1 that crashes typically take place at or near 
Ai = 1. This can be understood by taking a closer look 
at the structure of the dominant ACS. 

The dominant ACS consists of a 'core' and a 'periph- 
ery'. The core of a dominant ACS is the maximal sub- 
graph, Q, from each of whose nodes all nodes of the dom- 
inant ACS can be reached along some directed path. The 
rest of the dominant ACS is its periphery. For an exam- 
ple see Fig. 3. When the dominant ACS consists of two 
or more disjoint subgraphs the above definition applies 
to each component separately Q. This distinction be- 
tween core and periphery is useful in the context of the 
above dynamics. For example, the ratios of Xi values 
of the core nodes are unchanged if any periphery node 
or link is removed from the dominant ACS, but remov- 
ing or adding any node or link to the core in general 



changes all Xi ratios. For any subgraph A define Ai(^) 
to be the largest eigenvalue of the submatrix of C cor- 
responding to A. Then, it can be shown that Ai(Q) is 
the same as the largest eigenvalue of the whole graph, 
Ai. The core (of each component) is an irreducible sub- 
graph (i.e., one which contains at least two nodes and a 
directed path from each of its nodes to each of its other 
nodes). It follows from the Perron- Frobenius theorem 
that if some links are added to the core (with possibly 
additional nodes) Ai increases, and if removed from the 
core, Ai decreases. Thus Ai is a measure of the core size 
and multiplicity of pathways or 'redundancy' within it. 
Ai = 1 corresponds to the case where the core (of every 
disjoint component of the dominant ACS) has exactly 
one cycle. Such a core has no internal redundancy; the 
removal of any link from it will cause the ACS property 
(of that component) to disappear. 

This is one, purely graph theoretical, reason why the 
organization is fragile in the vicinity of Ai = 1. An- 
other reason is dynamical: when Ai > 1 the core nodes 
are better protected against selection by virtue of their 
larger populations, whereas at Ai = 1 they are more vul- 
nerable. The reason is as follows: Since X is an eigen- 
vector of C with eigenvalue Ai, when Ai ^ it fol- 
lows that for nodes that belong to the dominant ACS, 
Xi = (1/Ai) ^ CijXj. In particular, if a node i of the 
dominant ACS has only one incoming link (from the node 
j, say) then Xi = ATj/Ai, i.e., Xi is 'attenuated' with re- 
spect to Xj by a factor Ai. The periphery of an ACS is 
a tree like structure emanating from the core, with most 
nodes having a single incoming link, for small p. Consider 
for example Fig. 3, in which the entire graph is an ACS 
with Ai = 1.31, and focus in particular on the chain of 
nodes 44 -> 45 -> 24 ^ 29 ^ 52 ^ 89 ^ 86 ^ 54 -> 78. 
The farther down such a chain a periphery node is, the 
lower is its Xi because of the cumulative attenuation. For 
such an ACS with Ai > 1 the 'leaves' of the periphery 
tree will typically be the species with least Xi (and node 
78 in Fig. 3 is one such). However, when Ai — 1 there is 
no attenuation. Periphery nodes will not have lower Xi 
than core nodes and some may have higher if they have 
more than one incoming link. Thus at Ai = 1 the core is 
not protected and in fact will always belong to C if the 
ACS spans the graph. Ai is known to be of significance 
in other complex systems as well p^-^. 

We now present evidence that crashes are indeed due 
to changes in the structure of the core. Define the core 
overlap, denoted Ov{C',C'), between two graphs C and 
C" (whose nodes are labeled) as the number of common 
links in the cores Q and Q' of their dominant ACSs (i.e., 
the number of ordered pairs of nodes (i,j) such that 
Qij and Q^ ■ are both non-zero.) If either C or C" does 
not have an ACS, Ov{C,C') is by definition zero. A 
graph update event at time n will be called a core-shift 
if Ov{Cn-i,Cn) = (C„ is the graph at time n). Fig. 4 
shows that most (612) of the 701 crashes were core-shifts. 
(If a crash is defined as an event in which more than 90% 



of the species become extinct, then there are 235 crashes 
in these runs of which 226 are core-shifts.) Of the re- 
maining 89 crashes 79 were 'partial core-shifts' and 10 
were events in which the core remained unchanged. 

In the 612 core-shifts, the average number of incom- 
ing plus outgoing links is 2.27 for all nodes in the graph, 
2.25 for the node that is hit and 1.25 for the new node. 
Thus the nodes whose exchange causes the crash are not 
excessively rich in links (and the hit node is always the 
least populated). 'Nondescript' nodes such as these cause 
system wide crashes because of their critical location in a 
small core (the average core size at the 612 core-shifts is 
6.3 nodes) that is responsible for the coherence and suste- 
nance of the whole network |g^ . Core-shifts in which the 
ACS is completely destroyed typically cause the largest 
damage (of 612 core-shifts these are 136 in number, with 
|Asi| = 98.2 ±1.2). The remaining 476 in which an ACS 
exists after the core-shift have |Asi| — 75.0 ± 14.2. The 
former constitute an increasing fraction of the crashes at 
smaller -p values, causing the upturn in P(Asi) at large 
negative Asi for small p (Fig. 2). 

Let Tj denote the time for which the system stays in 
the organized phase until a core-shift occurs. Tg increases 
with p but its quantitative dependence on p and s remains 
an open question. After a crash if there is no ACS the 
graph usually becomes a random graph in order s time 
steps. It takes on average Ta = l/p^s time steps before a 
new ACS forms g]. Once an ACS appears it grows expo- 
nentially across the graph on a time scale Tg = 1/p. After 
crashes in which an ACS survives the recovery time scale 
is just 1/p. The asymmetry between positive and neg- 
ative changes in si is a natural consequence of different 
processes being involved in the two cases. 

It is characteristic of natural evolution that as different 
structures arise in the system the nature of the selective 
pressure on existing structures, and hence their effective 
dynamics, changes. In the present system we likewise 
see an effectively random graph evolution when there is 
no ACS, a self-organizing growth phase when an ACS, 
a small cooperative and hence robust structure, arises, 
and competition within the ACS resulting in its even- 
tual fragility when a fully autocatalytic graph is formed. 
A different system in which the selective pressures and 
dynamics change as new structures arise is discussed in 
p3[ . Robust yet fragile structures also arise in highly 
designed systems Q. In the present model the appear- 
ance of different structures dynamically generates differ- 
ent time scales: r^ in the random phase, Tg in the growth 
phase, and the survival time of the core, r^, in the orga- 
nized phase. These multiple structures and timescales 
arise endogenously, i.e., they are all consequences of the 
same underlying dynamical rules. 
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Figure legends 

Figure 1. The number of populated species, si (con- 
tinuous line) , and the largest eigenvalue of C (whose sig- 
nificance is discussed later in the text), Ai (dotted line), 
versus time, n. The Ai values shown are 100 times the 
actual Ai value. Runs shown have s — 100, and (a) 
p = 0.001, (b) p = 0.0025 and (c) p = 0.005. 



Figure 2. Probability distribution of changes in the 
number of populated species. P(Asi) is the fraction of 
time steps in which si changes by an amount Asi in 
one time step in an ensemble of runs with s = 100 and 
p = 0.001,0.0025,0.005. Only time steps where an auto- 
catalytic set initially exists are counted. 
Figure 3. The structure of the graph at n = 2885 for 
the run in Fig. lb, when the dominant ACS spanned the 
entire graph for the first time. Node numbers i from 1 to 
100 are shown in the circles representing the nodes. Black 
circles correspond to nodes in the 'core' of the ACS, and 
grey to the 'periphery', defined in the text. 
Figure 4. Frequency, /, of core overlaps in crashes for 
runs with s = 100, p — 0.0025. The a;-axis displays the 
value of Ov{Cn-i, C„) in crashes (i.e., in the 701 events 
with Asi(n) < -50). 
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